Frequency distributions of uniphones, diphones, and triphones in spontaneous speech.

نویسندگان

Victor Kuperman

Mirjam Ernestus

Harald Baayen

چکیده

This paper explores the relationship between the acoustic duration of phonemic sequences and their frequencies of occurrence. The data were obtained from large (sub)corpora of spontaneous speech in Dutch, English, German, and Italian. Acoustic duration of an n-phone is shown to codetermine the n-phone's frequency of use, such that languages preferentially use diphones and triphones that are neither very long nor very short. The observed distributions are well approximated by a theoretical function that quantifies the concurrent action of the self-regulatory processes of minimization of articulatory effort and minimization of perception effort.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Creation of unseen triphones from seen triphones, diphones and phones

With limited training data, infrequent triphone models for speech recognition will not be observed in suficient number. In this report, a speech production approach is used to predict the characteristics of unseen triphones by using a transformation technique in the parametric representation of a formant speech synthesiser. Two techniques are currently tested. In one approach, unseen triphones ...

متن کامل

Creation of unseen triphones from diphones and monophones using a speech production approach

With limited training data, infrequent triphone models for speech recognition will not be observed in sufficient number. In this report, a speech production approach is used to predict the characteristics of unseen triphones by concatenating diphones and/or monophones in the parametric representation of a formant speech synthesiser. The parameter trajectories are estimated by interpolation betw...

متن کامل

Predicting Unseen Triphones with Senones - Speech and Audio Processing, IEEE Transactions on

In large-vocabulary speech recognition, we often encounter triphones that are not covered in the training data. These unseen triphones are usually backed off to their corresponding diphones or context-independent phones, which contain less context yet have plenty of training examples. In this paper, we propose to use decision-tree-based senones to generate needed senonic baseforms for these uns...

متن کامل

Predicting unseen triphones with senones

In large-vocabulary speech recognition, the decoder often encounters triphones that are not covered in the training data. These unseen triphones are usually represented by corresponding diphones or context independent monophones. We propose to use decision-tree based senones to generate needed senonic baseforms for unseen triphones. A decision tree is built for each individual Markov state of e...

متن کامل

Training production parameters of context-dependent phones for speech recognition

A representation form of acoustic information in a trained phone library at the production parametric as well as the spectral level is described. The phones are trained in the parametric domain and are transformed to the spectral domain by means of a synthesis procedure. By this twofold description, potentially more powerful procedures for speaker adaptation and generation of unseen triphones c...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

The Journal of the Acoustical Society of America

دوره 124 6 شماره

صفحات -

تاریخ انتشار 2008

Frequency distributions of uniphones, diphones, and triphones in spontaneous speech.

نویسندگان

چکیده

منابع مشابه

Creation of unseen triphones from seen triphones, diphones and phones

Creation of unseen triphones from diphones and monophones using a speech production approach

Predicting Unseen Triphones with Senones - Speech and Audio Processing, IEEE Transactions on

Predicting unseen triphones with senones

Training production parameters of context-dependent phones for speech recognition

عنوان ژورنال:

اشتراک گذاری